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Abstract 

Algorithmic tools for graphs of small treewidth are used to address questions in com- 
plexity theory. For both arithmetic and Boolean circuits, it is shown that any circuit of size 
n*^'^^^ and treewidth 0(log* n) can be simulated by a circuit of width 0{\og^~^^ n) and size 
n'^, where c — 0(1), if i = 0, and c = O(loglogrt) otherwise. For our main construction, we 
prove that multiplicatively disjoint arithmetic circuits of size n'^^^^ and treewidth k can be 
simulated by bounded fan-in arithmetic formulas of depth 0{k^ logn). From this we derive 
the analogous statement for syntactically multilinear arithmetic circuits, which strengthens 
the central theorem of [MR08] . As another application, we derive that constant width arith- 
metic circuits of size n'-"-^'> can be balanced to depth O(logrt), provided certain restrictions 
are made on the use of iterated multiplication. Also from our main construction, we derive 
that Boolean bounded fan-in circuits of size 12'-"-^^ and treewidth k can be simulated by 
bounded fan-in formulas of depth 0{k^ logrt). This strengthens in the non-uniform setting 
the known inclusion that SC*' C NC^. Finally, we apply our construction to show that 
REACHABILITY for directed graphs of bounded treewidth is in LogDCFL. 

1 Introduction 

It is well-known that many hard graph theoretical problems become tractable when restricted 
to graphs of bounded treewidth^. If a graph with n nodes has bounded treewidth, there always 
exists a balanced tree decomposition of depth O(logn). This yields NC-algorithms for many 
problems, which are known to be NP-complete in general [Bod89]. 

Consider the following question. Suppose one is given a circuit (Boolean or arithmetic) of 
size s and bounded fan-in, for which the underlying graph has bounded treewidth. Does this 
imply, as intuition might suggest, there must exist an equivalent bounded fan-in circuit of size 
poly{s) and depth O(logs) ? We show that in the Boolean case the situation is as expected, 
which yields the following theorem: 



Theorem 1.1 The class of languages accepted by non-uniform constant fan-in circuits of poly- 
nomial size and bounded treewidth equals non-uniform NC^. 

Due to a celebrated result of Barrington [Bar86], it is known that NC^ can be simulated 
by constant width branching programs, which are skew circuits of constant width. A con- 
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^For a definition see Section 2. 
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stant width circuit can be evaluated using 0(1) memory, and hence SC° = NC^ (SC° is the 
class of Boolean functions computable by constant width circuits of poly size, c.f. [Mah07]). 
Theorem 1.1 strengthens this statement in the non-uniform setting. 

For arithmetic circuits, the short answer is that the equivalent circuit need not exist: a 
depth O(logs) circuit of bounded fan-in computes a polynomial of degree s^^^\ but using 
repeated multiplication a bounded trccwidth circuit of size s can easily compute a polynomial 
of degree 2^. We rephrase the question to avoid this triviality. 

The class of families of polynomials {pn}n>i of polynomial degree with polynomial size 
arithmetic circuits is known as VP (See e.g. [BCS97]). Let YF[tw = 0(log*n)] stand for the 
class corresponding to polynomial size circuits of trccwidth 0(log*n), and let VNC^ denote 
the class corresponding to bounded fan-in arithmetic circuits of depth O(logn). Our question 
becomes the following: is YP[tw = 0(1)] C VNC^ ? 

The above question remains to be wide open. Here, we show that multiplicaMvely disjoint^ 
circuits of size n*^^^^ and bounded treewidth can be simulated by bounded fan-in formulas of 
depth O(logn). In our notation this is stated as follows: 

Theorem 1.2 md-YP[tw = 0(1)] = VNC^ 

From this, we derive the analogous statement for syntactically multilinear circuits. The 
resulting formulas will be syntactically multilinear (denoted with the prefix sm-) as well. This 
implies the lower bounds by Raz [Raz04] hold all the way up to syntactically multilinear bounded 
treewidth circuits. We prove 

Theorem 1.3 sm-VP[tu; = 0(1)] = sm-VNC^ 

Above theorems shed some light on issues regarding bounded width circuits. One has 
the hierarchy of classes {VSC*}j>o, where VSC* corresponds to arithmetic circuits of width 
0(log*n) and size n*^^^\ for i > 0. We prove the following (and the Boolean analogue): 

Theorem 1.4 

1. VSC° C YP[tw = 0(1)] C VSC^ 

2. VSC* C YP[tw = O(log^n)] C YSC'+^size = nO(i°gi°g")], for any i > 1. 

Arithmetic circuit width is a fundamental notion, but it is still ill-understood in its relation 
to other resources. Recently, it has gained renewed attention from several researchers. It more 
or less embodies a notion of space in the arithmetic setting (See [MR09]). Mahajan and Rao 
[MR08] study the class YSC^[deg = 71*^(1)], which is obtained from VSC° by requiring formal 
degrees of circuits to be n'^^^\ Arvind, Joglckar, and Srinivasan [AJS09] give lower bounds for 
monotone arithmetic circuits of constant width. 

Theorem 1.3 strengthens the main result of Mahajan and Rao [MR08], which states that 
poly size syntactically multilinear circuits of constant width can be simulated by poly size 
circuits of log depth (but it was explicitly left open whether the latter could be ensured to be 
syntactically multilinear). Theorem 1.2 strengthens Theorem 4 in [JR09], which states that 
md-VSC° = VNC^ 

■^We indicate this with the prefix md-. See Section 2 for a definition. 
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In [MR08] the fundamental question is raised whether YSC'^ldeg = n'^(^)] C VNC^. As was 
mentioned, they show this holds under restriction of syntactic multilinearity. To make progress, 
we demonstrate a different restriction under which an efficient simulation by arithmetic O(logn) 
depth formulas is achievable. We apply Theorem 1.2 to give the following result for circuits 
with bounded iterated multiplication chains (For a definition see Section 4): 

Theorem 1.5 Constant width arithmetic circuits of size n'^^^^ with constant bounded iterated 
multiplication chains can be simulated by fan-in two arithmetic formulas of depth O(logn). 

As a final application, we consider the REACHABILITY problem. Given a (directed or 
undirected) graph G = {V,E) and s,t &V, this problem asks to test if t is reachable from s in 
G. REACHABILITY captures space bounded computation in a natural way. For directed graphs 
it is complete for NL [Jon75, Sav73]. The case of undirected graphs was settled recently by 
Reingold [Rei05] by giving a log-space algorithm for the problem. This shows the problem is 
complete for L. There has been extensive research aimed at settling the complexity of testing 
reachability on restricted graphs [ADR05, BTV07, KV09, ABC+06, AL98]. See [A1107] for more 
details. 

LogCFL and LogDCFL are the classes of languages that are logspace many-one reducible 
to non-deterministic and deterministic context-free languages, respectively. LogDCFL can be 
also characterized as the class of languages that can be recognized by a logspace Turing machine 
that is also provided with a stack, which runs in polynomial time. It follows by definition that 
L C LogDCFL C LogCFL and L C NL C LogCFL. However, it is unknown how NL and 
LogDCFL can be compared. In essense, this asks for a trade-off, trading non-determinism with 
stack access. Given that directed reachability is an NL-complete problem, giving a LogDCFL 
upper bound achieves such a trade-off for a restricted class of NL-computations. We show the 
following: 

Theorem 1.6 reachability for directed graphs of bounded treewidth is in LogDCFL, pro- 
vided the tree decomposition is given at the input. 

2 Preliminaries 

We briefly recall basic circuit definitions. A Boolean circuit is a directed acyclic graph, with 
labels {0, 1, xi, . . . , Xn, A, V, -i} on its nodes. Nodes with label from {0, 1, Xi, . . . , x„} are called 
input gates, and designated nodes of zero out-degree are called the output gates. The fan-in of 
a gate is its in-degree. Formulas are circuits for which the out-degree of each gate is at most 
one. We use the standard definitions of the size, width, depth and degree of a circuit. Fan-in 
is assumed to be bounded. The class NC^ is the class of boolean functions on n bits which 
can be computed by boolean circuits of depth O(logn) and size n^^^\ SC denotes the class of 
functions computed by polynomial size circuits of width 0(log* n). 

For arithmetic circuits over a ring R, nodes are labeled by ring constants, formal variables 
from a set X, and {-|-, x}. We assume fan-in is bounded by two. The output of an arithmetic 
circuit is a polynomial in the ring defined in the obvious way. The size of a circuit is taken 

to be the number of {+, x }-gates. For a circuit $ with designated output gate /, the polynomial 
computed by the output gate is denoted with [$] . We denote the set of variables using in $ 
by Var{^). Similarly we use Var{p), if p is a polynomial. Note that Far([$]) C Var{^). An 
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arithmetic circuit is called syntactically multilinear, if for each multiplication gate the subcircuits 
rooted at its inputs carry disjoint sets of variables. The formal degree of a circuit is defined 
inductively by taking variable and constant labeled gates to be of degree one. For addition 
gates one takes the maximum of the degrees of its inputs. For multiplication gates one takes 
the sum of the degrees. The degree of the circuit is taken to be the maximum degree of a gate. 

A p-family of polynomials is given by {frn}m>i, where /,„ G R[xi,X2, ■ ■ ■ , a^p(m)]) ^-i^d where 
deg(/r7i) < q{^)j for some polynomials p and q. Arithmetic circuit classes contain p-families. 
VP and VPe are the classes of p-families computable by arithmetic circuits and formulas, 
respectively, of size n'^^^^ (See e.g. [BCS97]). For i > 0, VSC* is the class of all p-families 
computable by arithmetic circuits of width 0(log* n) and size n^^^\ In [MR08] the class a-sSC* 
is considered, which corresponds to width 0(log* n) circuits of size and formal degree n^^^\ We 
will denote this class by YSC^ldeg = n*^^^)]. The class VNC* is the collection of all p-families 
computable by arithmetic circuits of depth 0(log*n) and size n^^^\ 

Next we define various graph parameters. The width of a layered graph is the maximum 
number of vertices in any particular layer. A tree decomposition of a graph G = (V, E) is given 
by a tuple (T, {Xd)(i£V[T])i where T is a tree, each is a subset of V called a bag, satisfying 
1) U(ieV^[T] Xd = V, 2) For each edge {u,v) G E, there exists tree node d with {u,v} C Xd, and 
3) For each vertex u ^ V, the set of tree nodes {d : u G Xd} forms a connected subtree of T. 
Equivalently, for any three vertices ti,t2,t2, G V[T] such that t2 lies in the path from ti to t^, 
it holds that Xt-, n Xt^ C Xt^. 

The width of the tree decomposition is defined as max^ \Xd\ — 1. The treewidth tw{G) 
of a graph G is the minimum width of a tree decomposition of G. For a rooted tree T, let 
X<t = UueStXu, where St = {u : u = t oi t is an ancestor of u}. 

Lemma 2.1 (Theorem 4-3 in [Bod89]) Let G = {V,E) be a graph with \V\ = n and treewidth 

a,t m,ost k. Then G has a tree decomposition (T, {Xd)deV[T]) of width 3k + 2 such that T is a 

binary tree of depth at most 2 [log 5 n] . 

4 

Proposition 2.2 A leveled graph G of width k has a treewidth at most 2k — 1. 

Proof. Let the levels of G be given by the sets of nodes Lq, Li, . . . , Lm, The required tree T 
is just a path ti,t2, ■ ■ ■ ,tm such that Vj Xt. = Lj^i U Lj. Clearly, for any v G the set of 

t G V\T] such that v ^ Xt form just an edge in T, and hence is connected. Every edge in G is 
covered by definition, and hence the this gives a valid tree decomposition of the graph G. 

3 Arithmetic Circuits of Bounded Treewidth 

The treewidth of a circuit with underlying graph G is defined to be tw(G). We introduce the 

class W[tw = 0(log* n)] as the class of p-families of polynomials {/„}„>! that can be computed 
by fan-in two arithmetic circuits of size rP^^'^ and treewidth 0(log*n). As Theorem 1.4 states, 
these classes interleave with the VSC* classes. We postpone the proof of Theorem 1.4 as it uses 
developments of our main construction. 

Theorem 3.1 For any multiplicatively disjoint arithmetic circuit $ of size s and treewidth k, 
there exists an equivalent formula T of size at most s^^^ ^ . 
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Proof. Let ^ he a multiplicatively disjoint circuit of size s, and let (T, (-^(i)deV[r]) be a tree 
decomposition of ^ of width k. By Lemma 2.1, we can assume that T is a rooted binary tree 
of depth d = O(logs). We first preprocess T and $ using Proposition 3.2. For a proof see 
Appendix A. 

Proposition 3.2 There exists a circuit of size at most 2s, for which [<I>] = \^'~\ , with tree 
decomposition (T', (-'^^)dGy[T']) of width at most k' = 3k + 2 and depth at most d, so that for 
any t G T' , for any non-input gate g & X[ with inputs gi and g2, either both gi,g2 G or both 
gi,g2 ^ X[. In the latter case it holds that gi ^ X'^^ iff 92 ^ ^<f 

We assume wlog. that $ and (T, {X(i)ii£VlT]) satisfy the conditions of Proposition 3.2, as 
the increase in k and s due to preprocessing does not affect the bound we are aiming for. For 
any tree node t E T and f e Xt, we define a circuit which is obtained from the subgraph 
$ [X<t] , by turning all g £ Xt that take both inputs from gates not in X<f into input gates with 
label Zg. For any / G Xt, let be the subcircuit of $t rooted at gate /. At most k + 1 new 
2;-variables will be used at the tree node t. Crucially, observe that, since $ is multiplicatively 
disjoint, any gate in computes a polynomial that is multilinear in z. 

We will process the tree decomposition going bottom up. At a node t, we want to compute 
for each / G a formula T^j equivalent to ^tj- Wlog. we assume the output gate of $ is 
contained in Xr, for the root r of T. Hence, when done, we have a formula equivalent to As 
will turn out, in order to keep the size of the computed formulas properly bounded, we require a 
constant bound on the number of appearances of a z-variable in Tf j. We achieve this by brute- 
force with Proposition 3.3, at the cost of blowing up the size by a factor of 2^^^. To verify its 
correctness, observe that the Ihs. and rhs. are multilinear polynomial in 2:2, . . . , ^fc+i] 

taking identical values on {0, 1}'^"^^, and hence must be identical. 

Proposition 3.3 Let f{x,zi,Z2,---,Zk+i) be a polynomial that is multilinear in z, then 
f{x, zi,Z2,..., Zk+i) = Ebe{o,i}fc+i (nie[fe+i](l - ^i)^"'"^') f{x, bi,b2, 

The recursive procedure for computing the desired formula equivalent to is given by 

Algorithm 1. Formally, for any t G T, and f Xt, let Tfj be the formula output by the 
procedure call Traceback{t, /). The following lemma proves its correctness: 

Lemma 3.4 For any t eT, and any f G Xt, \^t,f] = l^tj]- 

Proof. The proof will proceed by structural induction both on T and ^. The statement can 
be easily verfied for the two base cases: if t is a leaf of T, or / is an input gate in $. For the 
induction step, suppose t has children to and ti, and say f = fo ° fi, with o G {+, x}. We 
ignore line 17 of the procedure Traceback, since it does not modify the output of the computed 
formula. 

In case both /o,/i G Xt, by induction hypothesis, \Ttjo] = l^tjo] and \Ttj-,] = \^tji]- 
Observe that in this case Traceback{t, f) returns Ttj,) o F^jj, so \^tj~\ = I'^tjo ° Ti./il — 

Now assume not both /o,/i G Xt- By Proposition 3.2, this means /o ^ Xt and /i ^ Xt- 
Furthermore, we either have /o, /i G X<t, or {/q, /i} nX<t = 0. In the latter case, \^t,f^ = -2/, 
which is exactly what is returned by Traceback{t, f). In the former case, say /o G X<t^^ and 
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/i G -^<ti2i h,i2 £ {0,1}- Observe that by the tree decomposition properties / G Xt^_^, 
which makes the call of Traceback{t, f) on line 11 valid. Note that /q ^ X<ti2 ^ "^<*ii • 

Hence, by the tree decomposition properties, if ii 7^ ^2, there would exist a node t' with as 
ancestor such that /, /o G Xj/, but /i ^ Xj/. Due to Proposition 3.2 this case does not arise. 

The algorithm first computes T = Tracehack{ti^^ f). By the induction hypothesis 
[r] = In ^ti^j, whenever a gate g takes an input from a gate not in X<_t-^, i.e. 

by Proposition 3.2 this means both its inputs are not in X<t.^ , it appears as input node with 
label Zg. However, for the circuit node g roots <^t,g- Observe that this means that sub- 
stituting \^t,g\ for each Zg <E Var(\^t^^ f]) in f^'f,^,/] yields [$f./]. Observe that the tree 
decomposition properties give us that g E Xt, whenever we make the call on line 13 to com- 
pute r', and hence that this call is valid. By the induction hypothesis, [F'] = [$t,p]. Hence 
replacing, for all Zg € Var{\T~\), each gate in F labeled with Zg by the formula F' gives a new 
formula F satisfying [F] = \^tj] ■ □ 



We must bound the size of the formula F^ j. The idea is that since the number of z- variables 

in formulas is made to depend on k only, one has that Ttj is constructed from a{k) many copies 
of Tt^ g for children tj of t, for some function a(k) depending on k only. This implies a blow-up 
by a factor of a(k) to go up one level in T. In the actual analysis we find a{k) = 2°^^''^ More 
details can be found in Appendix B. 

Lemma 3.5 Let t G T be a node at height h, then for any f e Xt, Ft J has at most a'* 2*^+1 
many gates, where a = 2^'=^+^'=+^. 

Since T has depth O(logs), we conclude the final formulas given at the root of T will be 
of size s'^('^^). □ 

The proof of Theorem 1.2 is now clear. Trivially VPg C m.d-YP[tw = 0(1)]. The converse 
inclusion follows from Theorem 3.1. Now use the fact that VPe = VNC^ [Bre74]. 

3.1 Proof of Theorem 1.3 

Observe that sm- VNC^ C sm-VPe ^ sm-VP[tii) = 0(1)]- For the other direction, let $ be a 
syntactically multilinear circuit of treewidth k. We first modify it so that any gate g computing 
a field constant a is replaced by an input gate g' labeled with a. This can be done by removing 
edges fanning into g and relabeling. Hence the treewidth of the modified circuit is at most k. 
Next, any gate g labeled with a field constant a, with edges going to gates /i, /2, • • • , fm, is 
replaced by m separate copies of • • • , dm, each labeled with a, where we add edges {gi, fi), 

for all i e [m]. This does not increase the treewidth, as it can be thought of as a two step 
procedure, neither of which increases treewidth: first removing the vertex g and attached edges, 
secondly, adding back the isolated copies. Observe that now we have obtained an equivalent 
circuit that is multiplicatively disjoint. Namely, for purpose of contradiction, suppose there 
exists a multiplication gate / = /i x /2 such that both /i and /2 are reachable from some gate 
h. Then there exists such an h for which the paths to /i and /2 are edge disjoint. For this h, 
since is syntactically multilinear, there cannot be variables in the subcircuit Hence h 
is a gate computing a constant. Since the paths to /i and /2 are edge disjoint, h must have 
out-degree at least two. This contradicts the fact that any gate computing a constant in has 
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Algorithm 1 Recursive procedure for computing T^j 
1: procedure Traceback{t £ T, f £ Xt) 
2: if i is a leaf or / is an input gate in $ then 

3: return a formula equivalent to ^t,f of size at most 2^^+^ computed by 'brute force'. 
4: else 

5: let to cind ti be the children of t in T, and say / = /o o fi, with o G {+, x}. 

6: if both /o and /i are in Xt. then 

7: let r = Traceback{t, /o) o Traceback{t, fi). 

8: else 

9: // Neither /q nor /i is in Xt, by pre-processing. 

10: If /o and /i are not in X<t return a single node with label Zf. Otherwise, say 

/o e X<t^^ and /i € X<t.2 , for 11,12 £ {0, 1}. 
11: r = Traceback{ti^ , /) . 
12: for all Zg G Var{\T]) do 
13: let r' = Traceback{t, g). 

14: replace any gate in T labeled with Zg by the formula T' . 

15: end for 
16: end if 

17: Process r to make any z-variable occur at most 2^~^^ times using Proposition 3.3. 
18: return F. 
19: end if 



out degree one. The statement sm-VP[tu; = 0(1)] C VNCi now follows from Theorem 3.1 and 

the fact that VPe = VNC^ [Bre74]. 

To get the strengthened conclusion that sm-VP[itt; = 0(1)] C sm-VNCi, we will now 
indicate how to modify Algorithm 1 to ensure syntactic multilinearity. We use the notation of 
the proof of Theorem 3.1. Assume we have done preprocessing as indicated above. We know 
each circuit ^t,f is syntactically multilinear, for all t £T, and f £ Xt. The goal is to establish 
inductively that each Ttj is syntactically multilinear, for all t (zT, and f £ Xt. 

At the base case, i.e. line 3 of Algorithm 1, we can simply enforce the condition by brute 
force. At line 7, by induction Tfj^ and ^tji are syntactically multilinear. If o = +, then so 
is r. In case o = x, whenever the formulas Ttj^ and Fjj^ share a variable a, since we know 
[r] = is multilinear, a docs not appear in at least one of the polynomials [P^ j,,] and 

[Ptj^]. Setting a to zero in the corresponding formula ensures P is syntactically multilinear. 

We now argue how to correctly deal with the substitution on line 14, and the processing of z 
variables on line 17. Consider P as computed on line 11. We want to ensure it is in the following 
standard form: J2ae{0.i}''+'^ (llie[k+i] ^t''^ fa{x), for certain polynomials fa € ^[-^^J- For this 
we use the following modification of Proposition 3.3, which is obtained by multiplying out the 
factors njG[fc+i](l - Zi)^~'''z^i\ For a, a' G {0, l}'^'+\ we say a' < a iS {i : a'^ = 1} C {i : ai = 1}. 
We denote the size of {i : = 1} by |a'|. 

Proposition 3.6 Let /(x, zi, ^2, . . . , Zfc+i) be a polynomial that is multilinear in z. If we 
write f{x,zi,Z2,...,Zk+i) = Eae{o,i}'=+i (nie[fe+i] 4') coe/(/, z"^^' . . . ^^^en it holds 

that coefif, zl'zf . . . zl^^) = Ea'<a(-l)'"''/(^, 0!). 
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If we use the above proposition to process z-variablcs on line 17, then by induction, F 
on hne 11 will indeed have the required form, or for simplicity one can also assume we do an 
extra step of z-variable processing. That is, assume we apply above proposition to get F in the 
required form. This requires at most (2'^+^)^ copies of F and blows up F by an inconsequential 
factor of 2'-^^^\ Observe that this leaves F syntactically multilinear. 

Now consider line 14. First of all, any Zy G Var{T)\Var{\T^) can be set to zero in F. For 
the remaining z- variables, we claim that for any pair Zg,Zh G Far([F]), whenever Ff^g and F^^/j 
share a variable a, then coef{ [F] , m) = 0, for any multilinear monomial in the z- variables of F 
that contains l)oth Zg and Zh- Hence we can remove these terms from the standard form of F, 
and avoid multilinearity conflicts among products between each of the substituted formulas. 

We will verify this claim using the notion of a proof tree. A proof tree rooted at a gate g 
in a circuit C, is any tree obtained by recursively selecting gates, starting with g, as follows: 1) 
at an addition gate select exactly one of its children, and 2) at a multiplication gate select both 
children. We will consider proof trees of ^ti-^j rooted at /. For a subset Z of z-variables in 
^f.^j, we let PTree(Z) stand for the collection of proof trees rooted at / that have precisely the 
z- variables in Z appearing at its leaves. Given T G PTrcc(Z), let p{T) denote the product of all 
X variables appearing in T. The following proposition is easily proved by structural induction 
to the circuit f. 

Proposition 3.7 For any multilinear monomial m in z-variables used in ^ti^j, it holds that 
coe/([$t.^ j],m) = J2T&PTree{z)P('^)' "^here Z is the set of z-variables of m. 

Recall that by induction [F] = [<I>f,^j]. Now consider any multilinear monomial m in 
z-variables of j] with both Zg and Zh in it, where F^^^ and Tt^h share a variable a. For 
purpose of contradiction suppose coe/([$t.^ j],m) ^ 0. By Proposition 3.7 this means there 
exists a proof tree in rooted at / that contains both Zg and z^. This implies g and h are 

reachable from a single multiplication gate r in ^t^^ j, and hence also in ^tj- Observe that our 
construction satisfies the property that for any t € V[T] and / G Xt, Var{Ttj) C Var{^tj)- 
Hence a appears in both ^t,g and $t ft. Observe that both a's must be reachable from r in 
This contradicts the fact that ^tj is syntactically multilinear. 

Similarly, one can verify that whenever for a variables Zg G yar([F]), the formula Tt^g 
contains a variable a, then coe/([F],m) does not contain a for any monomial m containing Zg. 
Hence any occurrence of a in the formula X^(j/<„(— 1)I"'If(x, a') used to compute coe/([F],m) 
can be replaced by zero. 

We conclude that under above modifications. Algorithm 1 yields a syntactically multilinear 
formula Fjj equivalent to The proof is completed with the observation of [MR08] that 

Brent's construction [Bre74], which shows VPe C VNC''^, preserves syntactic multilinearity. □ 



3.2 Proof of Theorem 1.4 

The first inclusion of items 1. and 2. follows from Proposition 2.2. For the second inclusion of 
items 1. and 2, consider a circuit $ of size s having tree decomposition (T, {Xd)d£V[T]) of width 
k. By Lemma 2.1, we can assume that T is a rooted binary tree of depth d = O(logs), and we 
assume wlog. we have already applied Proposition 3.2 for preprocessing. For t & T and f E Xt, 
define ^tj as in the proof of Theorem 3.1. We will argue how to obtain small width circuits 
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^t,f satisfying \'i't,f~\ = \^t,f~\ ■ As before, we assume wlog. the output gate of $ is eontained 
in the bag at the root of T, so that when done a small width circuit computing [$] is known. 

For any leaf i G T and f G Xt, we trivially have an equivalent circuit for ^tj of size k + 1 
and width k + 1. 

Now consider t G T with children to Skud ti. Suppose a set of equivalent circuits iS" = 
{^to,g}geXt„^{^ti,g}geXt^ has already been computed for the set of circuits S = {^to,g}geXto U 
{^tug}geXt^- 

Let hi,h2, ■ ■ ■ , hk+i be the sequence of nodes in Xf as they appear in an arbitrarily selected 
topological sort of Let w be an upper bound on the width of any component in S' . For 
all i G [/c + 1], wc will build a circuit of width at most w + i — 1^ that has i many outputs 
carrying the values for all 1 < j < i. Wlog. we allow ourselves addition gates of fan-in 

one, which are used to pass through values. 

For i = 1, /ii takes both inputs from outside Xf. We observe that either is 
a z- variable, or some component in S' already computes [^J^t^fe^]. Namely, consider what 
Traceback{t,hi) does in order to compute [<I>f^/i^]. In this case, it either outputs a node 
with a z-label, or it first compute F = F^^ /^^, for some a G {0,1}, but there would be 
no substitution performed by Traceback{t, hi), for any Zg in F, since that would mean 
there exists a gate g E Xt such that hi is reachable from g. By the correctness of Algo- 
rithm 1, f$to,/iil = [r^^^/j^] = [F] = So we can take to be either a single input node 
labeled with a 2;-variables, or take it as some element of S'. In both cases the width is at most 
w. 

Inductively, for i > 1, suppose we have build the circuit ^i-i- It has outputs computing 
for 1 < J < i — 1. First consider the case that hi takes both inputs from within Xf. 
Similarly by inspection of what Algorithm 1 does for this case, it can be observed that "^tM^ 
can be build by adding or multiplying two components '^t,hp ^■nd ^t,hq with p,q < i. Hence 
we obtain \I'* by adding a single gate to vj/*-! and passing through all other outputs. Hence, in 
this case we have width{^'^) < max(u;idi/i(^*~^), 1 + i — l)<w + i — 1 

Finally, suppose hi takes an input not in Xf. We then know neither inputs are in Xt, 
since we assume we have applied Proposition 3.2. Let us consider Algorithm 1, to see what 
Traceback{t, hi) does to compute [<I*t,/i.]. Either a z-variable is returned, which will be easy to 
handle. Otherwise, it computes F with [F] = , for some G S. Then for z-variables in 
[F], values are substituted of form for g G Xt. Observe that hi is reachable from any 

such G in and hence g = hj for some j < i. 

From this we can conclude that the required either is a z-variablc, or can be build from 
one component 5' from S' and feeding outputs of ^'■^ for j < i into z- variable gates appearing in 
We do this by adding a copy of * below and passing alongside ^' the output values of 
This makes these values available so we can do the appropriate substitutions of 2- variables 
in Observe that this way has width at most max{width{'^'^~^) , width^"^) -\-i — l) < w-\-i — l. 

From the above, we observe that any 5'* will be at most a factor 0{k) larger than any 
component in S'. We conclude that the width increases additively by 0{k), and the size 
multiplicatively by 0{k), in order to go up one level in T. We conclude that a width 0{k\ogs) 
and size s<^('°sfe) bound holds for any ^'r-Ji with r being the root of T and / G Xr- □ 
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3.3 Evaluation over a Finite Field and Boolean Implications 

The observation is that Algorithm 1, when apphed over GF{2) to an arbitrary n-input arith- 
metic circuit will result in a formula T such that for any a G GF{2)'^, [$](o) = [r](a). 
For this, no assumptions regarding multiplicative disjointness of <1> is needed. One can prove 
this condition using structural induction similarly as in Lemma 3.4. For the processing of the 
z-variables on line 17, observe that we have the following adaption of Proposition 3.3: 

Proposition 3.8 Let f{xi,X2, ■ ■ ■ , Xn, zi, Z2, ■ ■ ■ , Zk+i) be a polynomial over GF{2). Define the 
polynomial g = E;,g{o,i}'=+i (nje[fc+i](l - Zif-^izl"^ f{xi,X2,...,Xn,biM,-- ■,h+i)- Then for 
any a G GF(2)"+'=+^ /(a) = g{a). 

One can generalize this to an arbitrary finite field F of size q = \F\, by similarly using 
brute force on line 17 of Algorithm 1 to make sure any z-variables appears at most q^~^^ times 
in r. Consequently, we have the following theorem: 

Theorem 3.9 Let F be a finite field, and let ^ = 1^1. For any arithmetic circuit $ over F of 
size s and treewidth k, there exists formula T over F of size at most such that $ and 

r evaluate to identical values for all inputs from F. 

For a proof of the following proposition see Appendix C. 

Proposition 3.10 For every Boolean circuit C of bounded fanin and treewidth k, there is an 
arithmetic circuit C over GF{2) of treewidth Sk such that Vx G {0, 1}", G{x) = 1 if and only 
ifC'{x) = l. 

We will now derive Theorem 1.1. Given a Boolean circuit of size s and treewidth k, first 
convert it into an arithmetic circuit over GF{2) using Proposition 3.10. Now apply Theorem 3.9 
to obtain an arithmetic formula F over GF{2) of size s^^^ \ Balance this formula down to depth 
O(fc^logs) using [Bre74]. Now do the reverse construction of arithmctization and code out an 
{a, V, -i}-formula computing the same function. The final circuit has depth 0{k^logs). Thus 
we have proven Theorem 1.1. 

We can use a similar reduction to derive a Boolean analogue of Theorem 1.4. The proof 
is contained in Appendix D. Let TWO* denote the class of Boolean functions computed by 
Boolean circuits of treewidth 0(log* n). 

Theorem 3.11 The following inclusions hold in the non-uniform setting 

1. SC° C TWC° C SC^ 

2. SC^ C TWC^ C SC'+\size = nOaogiog")], for alli>l. 

4 Constant Width Circuits 

Definition 4.1 An iterated multiplication chain of length m in a circuit $ is given by a se- 
quence of gates go,gi, ■ ■ ■ ,gm, where all are multiplication gates, except possibly go, such that 
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both inputs of gi are reachable from Qi-i, for all i G [m\. We denote the length of a longest 
iterated multiplication chain in $ by M{(^). 

Note that if M.{^) = 0, then $ is multiplicatively disjoint. 

Lemma 4.2 For any leveled arithmetic circuit $ of size s, width w, with fanout of every 
gate bounded by two, there exists equivalent multiplicatively disjoint circuit <J>' of size at most 

^^glt ~^ ~ -^^^ circuit has a tree decomposition (T, {Xt)t,zv[T]) of width at most 2w — 1, 
such that for any set L of nodes in $ on the same level, there exists t eT with L C. Xf. 

Proof. The lemma is proved by induction on d. Let the levels of ^ be given by sets of gates 
Lq, Li, ... , Lm, with edges going from to Lj+i only, for i = 0, 1, . . . , m — 1. The circuit $ has 
a tree decomposition (T, {Xt)f(zv[T] ) of width 2w — l, where T is given by the path t\,t2, ■ ■ ■ , t^n, 
and Xf. = Lj-i U Lj, for j G [m]. 

In case d = 0, we already have that ^ is multiplicatively disjoint, and we can conclude the 
lemma holds. For the induction case, suppose d > 0. 

We proceed by going from level r = m — 1 down to r = 0. At level r, let {g^)i^i be the set 
of gates in L^, such that for every i & I, g^ £ Lr is used by two gates g\,g2 & -^r+i such that 
from g\ and g2 there is an identical multiplication gate reachable. 

If / is non-empty, do the following: Create a copy of the subcircuit of $ that consists 
of the gates {g^)i^i, together with all the gates these depend on. Note that M.{^i) < d — 1. 
Inductively apply the lemma to make $/ multiplicatively disjoint. Modify by removing the 
edges (ff'j^l)' adding the edge from g^ in the copy of to (73, for all i £ I. 

Once all levels of $ have been processed in the above manner, it is clear the resulting circuit 
is multiplicatively disjoint. Now we bound the size of this circuit. By induction hypothesis, the 
copy of has size at most ^ — 1. In the worst case we create such a copy at each level, i.e. 

at most s many times . Hence the created circuit has size at most s + s • ( ^ ^ 1 ) = ^ ^-r^ ~ • 
Finally, we verify that the constructed circuit has treewidth at most 2w; — 1. Wc will check 
that the duplication performed at level r results in a new circuit with a tree decomposition 
satisfying all the required properties. By induction hypothesis, for the multiplicatively disjoint 
version of $7 we have tree decomposition (T', {Xt)t^v{T']) of width at most 2w — l. Furthermore, 
there exists t' G T' such that for all i £ I, the copy of 5* is in Xfi. We construct a tree 
decomposition {T" , (^t)tgv'[r"]) from T and T' as follows. First, create a new tree node t" with 
bag Xfii containing the copy of 3* and for all i G /. Next, let T" be the tree obtained by 
connecting tree node tr+i G T with t" , and connecting t" to t' G T' . The bag t" covers the 
edges from the copy of g^ to 52 ) fo^ ^ i £ I. Other edges are either covered in T or in T". 
The decomposition properties can now easily been seen to hold. Note that for any level Lr in 
$ there exists a bag in T" containing Lj.. Bag sizes in T" are bounded by 2u; — 1. □ 



Theorem 4.3 For any leveled arithmetic circuit $ of size s and width w, there exists equivalent 
formula T of depth d = 0{w^M{^) logs) and consequently size at most 2'^. 

Proof. Consider any leveled arithmetic circuit $ of size s and width w. The fanout of every 
gate in $ is bounded by w. We first modify $ to have the fanout of every gate bounded by two. 
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This is done by adding dummy 'add to zero' addition gates arranged in complete binary trees 
of depth at most [logti;]. The width of the resulting circuit will be 0{w'^), and its size 0{'ws). 
Next we apply Lemma 4.2 to obtain an equivalent circuit of size s"^^^^^^^ and treewidth 0{w^) 
that is multiplicatively disjoint. Now apply Theorem 3.1 to get an equivalent formula of size 
gO(w A^($)) Finally, balance this formula using Brent's construction [Bre74] down to depth 
0{w^M{^)logs). □ 

Theorem 1.5 immediate follows from Theorem 4.3. Note that conversely one has the 
inclusion VNC^ C VSC°[X = 0(1)], due to [BC88]. We remark that Theorem 4.3, to the best 
of our knowledge, when applied to an arbitrary circuit family {^n}n>i of size n'^^^\ constant 
width, and 7W(<&n) = o{logn), the resulting size formulas {r„}„>i provide the best- 

known upper bound for the size of equivalent formulas. For M.{^n) = 0{\ogn), one is dealing 
with the general case of the class YSC^[deg = n'~'^^^]. In this case our construction yields 
equivalent 0(log^ n)-depth formulas of size n'^(^°s") _ Formulas with such parameters can also 
be obtained using the construction in [VSBR83] . 

5 Testing Reachability in Bounded Treewidth Graphs 

The following proposition is proved in Appendix E: 

Proposition 5.1 Given a directed graph G = iy,E) of hounded treewidth and two vertices 
s,t , we can obtain a circuit C of hounded treewidth and an input x such that C(x) = 1 if 
and only if t is reachable from s in the graph G. 

For the proof of Theorem 1.6, we compose the recoding of the input of Proposition 5.1 and 

2 

Proposition 3.10, to get an arithmetic circuit C over GF{2) and x G {0, 1}"' , together with its 
tree decomposition, such that t is reachable from ,s in G iff C{x) = 1. This simple recoding 
of the input can be computed within logspace. Hence the theorem follows by the following 
observation: 

Proposition 5.2 Given an arithmetic circuit C over GF{2), its tree decomposition 
(r, {X(i)deV[T]) of constant width k, and an input x G {0, 1}", testing whether C{x) = 1 can he 
done in LogDCFL. 

Proofsketch. The proof proceeds by analyzing the algorithm Traceback. We are given the 
circuit C and an input x. We replace each gate of C labeled by xi with its Boolean value given 
as input. Next we run Traceback to compute an equivalent formula. A straigtforward analysis 
gives that this computation takes time polynomial in the length of the input. We claim that 
in addition wc can implement the algorithm using only O(Iogn) workspace, provided we use a 
stack (whose space usage is not counted towards the space bound). 

We implement the recursion by using the stack in the usual way. At any point in the 
recursion, the configuration can be represented with O(logn) space. Namely, by considering 
the source-code of Traceback^ any of the variables, with the exception of r,r', and Fjj, are 
pointers into either the graph or the tree decomposition and hence take O(logn) size. 

The observation is that for any of the equivalent formulas being held by r,r', and F^j, 
there are no x-variables appearing, since these are replaced by Boolean values. Traceback 
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ensures these formulas are represented as multilinear polynomials in z-variables over GF{2). 
For any such formula there are at most k + 1 different Zg's, since whenever Zg is a variables in 
Ttj, we always have g € Xt. Furthermore, each Zg appears at most 2^^^ times. Each Zg has 
associated a pointer to a node g in the graph. We conclude it takes 0{k2'''^^ \ogn) to represent 
any of F, F', and Fj j. This makes it possible to execute the algorithm in polynomial time on a 
Turing machine with a stack and O(logn) workspace. The final equivalent formula will simply 
be the value C{x) G {0, 1}. □ 
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A Proof of Proposition 3.2 

Let us show how to enforce the first condition. Suppose at some t G T we have a node g (z Xt 
with inputs gi and g2 with gi G Xf, but g2 ^ Xt. This is resolved by adding an addition gate 
a and an input gate bg^ labeled by zero to removing the edge {g2,9), and adding edges from 
b and 52 to a, and from a to g. T is modified by adding a to every bag containing g, and 
b is added to every bag containing (72- Repeat this procedure until all conflicts are resolved, 
where for any node 52 we reuse the node Zg^ with label zero, if it has already been introduced 
at a previous operation. Hence for any node g we add at most two addition gates and at most 
one gate bg to all bags containing g. Hence the width of the new tree decomposition will be 
at most 3k + 2. The depth will remain to be d. The second condition follows from the first. 
Namely, suppose gi G By the tree decomposition properties it must be that g and gi 

are contained in X^, for some descendant t'oft. Hence 52 € X^,. □ 
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B Proof of Lemma 3.5 



Proof. We use B{h) to denote a bound on the size of any Ttj, for a t at height h and / G X^. 
If t is a leaf, then we can give a formula for F^j of size at most 2*^+^, so we take -B(O) = 2*^+-'^. 
Now suppose height{t) > 0. Consider unfolding the recursive calls of Traceback{t, f), where 
we do not unfold calls of Traceback{t' , f), for children t' of t. Rather, for these we take the 
upper estimate that there we obtain a formula F of size at most B{h — 1) with at most k + 1 
many z-variables, each of which appears at most 2*^+^ many times in F. We can partition the 
calls in stages: Traceback{t, f) is at stage 0. Any recursive call made of Traceback at stage i 
remains at stage with the exception of calls of Traceback on line 14. These calls signal the 
next stage, i.e. are at stage i + In total the number of stages is bounded by + 1. 

For the first stage in the worst case first line 7 is executed up to recursion depth /c+1, before 
a call is made for a child of t. This gives a formula T of size at most (2'^"'"^ — \)+2^^^B{h—\) < 
2^+'^B{h — 1). It has at most 2^~^^ ■ {k + 1)2^+^ < 2^*^+^ many occurrences of z variables. 
For each of the z variables in we start unfolding second stage calls of Traceback. For 
each such call we have the same bounds 2^^'^B{h — 1) on the number of tree nodes, and 
23fc+2 £qj, £]^g number of 2:-variables it produces. This process continues up to stage at most 
k + 1. Since the z variables cause a branching of degree at most 2^*^"*"^, we can state that 
B{h) = {2^+^B(h - 1)) • 2(3^+2)('=+2) = (23fe'+9fe+6)s(/i _ 1). This proves the lemma. □ 



C Proof of Proposition 3.10 

The standard way to obtain C' is by replacing in C each A(/i,/2) by /i x /2, each -•(/i) by 
1 + /i, and each Va(/i, /2) by (/i +„ /2) +b (/i Xc /2)- In the latter, we labeled the gates to 
indicate which vertices arc reused. Wc introduce new vertices and edges only in the case of -i 
and V. Modify the tree decomposition of C, to get one for C, as follows: In the case of -■, it 
suffices to add the new isolated vertex labeled by 1 to one bag containing the -> gate. In the 
V-case it suffices to add newly created vertices b and c to all bags containing a. □ 



D Proof of Theorem 3.11 

The lower bound in (1) and (2) directly follows from Proposition 2.2. To argue the upper 
bounds, we convert the given TWC'-circuit into an arithmetic circuit <I> over GF{2) of bounded 
treewidth using Proposition 3.10. Now applying the construction from the proof of Theorem 1.4 
to will give us a circuit over GF{2) of width 0{\og^^^ n), and size n"^, where c = 0(1) if i = 0, 
and c = n'^^^'^s^osn) otherwise. Now construct the required Boolean SC*"^^-circuit by replacing 
each gate by a Boolean gadget computing the arithmetic operation over GF{2). □ 



E Proof of Proposition 5.1 

We describe the standard construction and then argue that it preserves treewidth. Given the 
graph G, the circuit C is obtained by placing an V gate at every node v € V. The root gate 
is the V gate placed at t. Now for every edge {u, v) e E place an A^^ of fanin 2 and fanout 1 
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which receives an input from the and feeds into V„. The second input of /\uv is the variable 
Xuv which is essentiahy the [u, v) entry of the adjacency matrix of G given at the input. The 
V gate placed at s will have an additional input which is assigned to a value 1. It is clear from 
the construction that this 1 propagates to the output gate if and only if there is a directed path 
from the node s to the node t. 

Now we need to argue that this construction preserves treewidth. We will show this 
constructively. Suppose we are given a tree decomposition (T, (-'^d)dey[T])) such that \/t \Xt\ < 
k. We will show that the circuit C obtained above also has a tree decomposition with < 
k + 2. The construction simply adds the vertex A„„ and the input node Xuv to the bag which 
contains the vertices correpsonding to and Vy. This covers all the new edges introduced, 
namely (V„, A„^), (A„^, V^), {xuv, /\uv)- The width has increased just by 2. □ 
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